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ABSTRACT 

Context. 

Aims. Our purpose is to find which is the most reliable one among various oxygen abundance determination methods. 
Methods. We will test the validity of several different oxygen abundance determination methods using methods of modem statistics. 
These methods include Bayesian analysis and information scoring. We will analyze a sample of ~6000 Hii galaxies from the Sloan 
Digital Sky Survey (SDSS) spectroscopic observations data release four. 

Results. All methods that we used drew the same conclusion that the Tg method is a more reliable oxygen abundance determination 
methods than the Bayesian metallcity method under the existing telescope ability. The ratios of the likelihoods between the different 
kinds of methods tell us that the Tg, P, and 03N2 methods are consistent with each other because the P and 03N2 method are 
calibrated by r^-method. The Bayesian and R23 method are consistent with each other because both are calibrated by a galaxy model. 
In either case, the N2 method is an unreliable method. 

Key words, galaxies: abundance - galaxies: starburst - stars: formation 
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I ■ 1. Introduction 

> 

— Hii galaxies with strong emission lines are important probes 

00 for the formation and evolution of galaxies. Their spectra 
contain much important information needed to determine the 

Tj" star formation rate, initial mass function, element abundance, 
etc. (Stasiiiska & Leitherer 1996; Kennicutt 1998; Contini 
etal. 2002). The heavy element abundance is a key parameter 

f — for the formation and evolution of a galaxy. Oxygen is one of 
the most important elements and is most easily and reliably de- 
ILJ termined since the most important ionization stages can all be 

• observed. 

There are various methods for determining the oxygen abun- 
?H dance for the Hii galaxies based on the strong emission lines. 

5^ , The oxygen abundance by the measurement of electron temper- 
ature from [Oiii].i/i4959,5007/[Oiii]/l4363 is one of the most re- 
liable methods. Tremonti et al. (2004) provided Bayesian abun- 
dances statistically by the MPA/JHU group Q, based on simul- 
taneous fits of all the most prominent emission lines ([On], 
Ha, [Om], Hj8, [Nii], [Sii]) with a model designed for the in- 
terpretation of integrated galaxy spectra (Chariot & Longhetti 
2001). Instead of the Tg method and Bayesian method, strong 
line methods such as the /?23Q ^"0, A^2Qor 03 A^2[3 methods are 
used widely (Pagel etal. 1979; Kobulnicky etal. 1999; Pilyugin 
etal. 2001; Chariot & Longhetti 2001; Denicolo etal. 2002; 
Pettini & Pagel 2004; Tremonti etal. 2004; Liang etal. 2006). 
Among these methods, which one is most rehable is an unsolved 
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2 /e23=([0 ii].13727-i-[0 iii]/l/l4959,5007)/I^e 

3 /'=[Om]^^4959,5007/([O ii]/l3727-i-[0 iii]/l/l4959,5007) 
" A'2=log([Nii]^6583/Hci') 

^ 03A'2=log(([0 iii]/l5007/He)/([Nii].l6583/HQ')) 



problem and debated broadly by many investigators (Stasiriska 
2005; Pilyugin & Thuan 2005; Shi etal. 2005, 2006, 2007; 
Kobulnicky etal. 1999). So it is necessary to study this issue 
again and find the most reliable oxygen abundance determina- 
tion method. 

The judgment of which method is the best one depends on 
how well a model agrees with the data. If one wants to fit the 
data better, expanding the set of free parameters in the model is 
needed. Thus, adding more free parameters will improve the fit, 
but make the model more complex. A more complex model is 
unsatisfactory compared to a simple model if the two models fit 
the data equally well. Therefore, one must decide what improve- 
ment of accuracy in the fit and the penalty paid by introducing 
the new parameter. 

Such consideration forms the spirit of Occam's razor 0. A 
quantitative formulation should combine the goodness of fit with 
a penalty function measuring the complexity of the theory, or di- 
rectly measure the credibility of the model rather than the fre- 
quency of occurrence as in the classical approach. Bayesian sta- 
tistical analysis and information scored statistical analysis try 
to find the most parsimonious models that adequately fit the 
data. These approaches grow from the requirements mandated 
by Occam's razor. Under Bayesian statistical methods, this per- 
mits us to assign a posterior probability for the validity of a phys- 
ical model, in our case of the method of calculating the oxygen 
abundance. A method is preferred when its posterior probabiUty 
exceeds that of any other competing method. 

The Sloan Digital Sky Survey (SDSS) is the most ambitious 
imaging and spectroscopic survey to date, and will eventually 



* Occam's razor is a principle attributed to the 14th-century English 
logician and Franciscan friar William of Ockham. Occam's razor is of- 
ten paraphrased as "All things being equal, the simplest solution tends 
to be the best one." 
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cover a quarter of the sky (York et al. 2000). The large area cov- 
erage and moderately deep survey limit of the SDSS make it suit- 
able for studying the physical properties of the galaxy. Because 
of its good homogeneity, the SDSS provides a large sample of 
Hii galaxies where oxygen abundance can be calculated with 
the various methods. 

This paper is organized as follows: based on a SDSS DR4 
starbursts spectral sample, we present a sample which we can 
use to calculate the oxygen abundance with various methods 
(Sect|2]l. In Sect. [3] we calculate the oxygen abundance with var- 
ious methods. In Sect. [H we measure the credibility of various 
oxygen abundance determination methods using Bayesian anal- 
ysis and information scoring. In Sect.|5j we discuss the reliability 
of various oxygen abundance determination methods, and con- 
clude this paper. 

2. Data sample 

We have used H n galaxies from the Fourth Data Release (DR4) 
of the SDSS. After subtracting the underlying starlight using 
the method of Li et al. (2005) and Lu et al. (2006), we fit the 
emission lines using the method of Dong et al. (2005). We made 
the internal reddening correction for the flux of all the emission 
lines, using the two strongest Balmer lines, Ha/Hfi and the ef- 
fective absorption curve - Tv(/i/5500A)"*'-^, which was in- 
troduced by Chariot & Fall (2000). Then, we made use of the 
spectral diagnostic diagrams from Kauff'mann et al. (2003) to 
classify galaxies as either starburst galaxies, active galactic nu- 
clei (AGN), or unclassified. To reduce systematic and random 
errors from aperture effects, our galaxy sample is limited by the 
requirement of redshift z > 0.04 (Kewley et al. 2005). 

Within the primary starburst sample, two subsamples were 
selected from the SDSS-DR4 with the fluxes of spectral lines 
for all [On]^3727, H/3, [Oiii]^4959, [Om]/15007, Ha and 
[Nn]/16583 higher than the flux uncertainty. The diff'erence be- 
tween these two subsamples is that the first subsample (Sample 
I) is selected by the additional criterion from the [Om]/i4363 
line, that the flux uncertainties for [Oiii]/14363 are higher than 
Icr. 409 galaxies were included in this subsample. [Om]/i4363 
is strongly dependent on the metallicity of galaxies; it becomes 
undetectable in high metallicity galaxies. Therefore, galaxies in 
Sample I are low metallicity galaxies. In the second subsample 
(Sample II), galaxies have weak or no [Om]/i4363 line. 5880 
galaxies were selected in this subsample. The average uncer- 
tainty in flux measurement in the computed 12-1- log(0/H) val- 
ues is typically 0. 10 dex in both of Sample I and Sample II. 

3. Determination of oxygen abundance 

3.1. Te method 

To derive oxygen abundances with the Te method, we de- 
termined Te and for a two-zone photoionized Hii region 
model. It is well established that for low metallicity galax- 
ies, 12 + log(0/H) < 8.2, [Oiii]/l4363 is prominent and can 
be measured accurately, while for high metallicity galaxies, 
[Om]/t4363 is weak and the error of its measurement is large. 
We use an N2 indicator to distinguish high metallicity regions 
from low metallicity regions (see Sect. 13. 4t . For low metallicity 
galaxies (12 h- log(0/H)^2 < 8.2), we used a five-level statis- 
tical equilibrium model in the IRAF NEBULAR package (de 
Robertis, Dufour, & Hunt 1987; Shaw & Dufour 1995), which 
made use of the latest collision strengths and radiative transition 
probabilities to determine the Te and n^. For high metallicity 



galaxies(12 + log(0/H)^2 > 8.2), an empirical relation of Te 
and strong spectral lines has been adopted for the electron tem- 
perature determination (Pilyugin 2001). The temperature will be 
used to derive the O^^ ionic abundances. 

To estimate the temperature in the low-temperature zone 
r,.(OII), the relation between Te{OlT) and r^(OIII) from 
Garnett (1992) was utilized : 



aOII) = 0.7x^0111) -H 0.3, 



(1) 



where te-T^ /IQ'^ K. The temperature re(OII) is used to derive 
the ionic abundance. 

After calculation of Te and He for the high-temperature zone 
and low-temperature zone, we used the expressions from Izotov 
etal. (2006) to calculate the oxygen abundance for these two 
zones. Then we simply sum and O^^ as our final oxygen 
abundance. 



3.2. R23 method 

Te metallicity determination requires the accurate measurement 
of the weak auroral forbidden emission line [Om]/i4363. The 
flux intensity of [Oiii]/l4363 strongly anticorrelates with the 
abundance of galaxies. Its flux intensity becomes undetectable 
in high metallicity galaxies (12 + log(0/H) > 8.2). 

For this reason, the strong line metallicity indicator R23 has 
been developed since Pagel et al. (1979) introduced it for the first 
time. We use the most recent R23 analytical calibrations given 
by Kobulnicky et al. (1999) which are based on the models by 
McGaugh (1991) to determine the oxygen abundances in our 
sample. 

The major difficulty associated with this method is that 
the relation between oxygen abundance and R23 is double val- 
ued, requiring some assumption or rough a priori knowledge 
of a galaxy's metallicity in order to locate it on the appro- 
priate branch of the curve. In this work, the [Nii]/16583/Ha' 
line ratio will be used to break the degeneracy of the R23 
relation (Denicolo et al. 2002). The division between the up- 
per and the lower branch of the R23 relation occurs around 
log([Nii]/l6583/HQ') ^ -1.26 (12 + log(0/H)^ 8.2). 

3.3. P method 

The R23 method was used widely but R23 abundances were 
found to be systematically larger than the Te metallicity. Pilyugin 
(2000, 2001) found that its error had two parts: a random error 
and a systematic error. The origin of this systematic error is the 
dependence of the oxygen emission lines on not only the oxy- 
gen abundance, but also on the other physical conditions (hard- 
ness of the ionizing radiation and a geometric factor). Pilyugin 
(2000, 2001) introduced the P method, another strong line met- 
alloid indicator to overcome these problems. We use the expres- 
sion of Pilyugin (2001) to calculate the abundance of oxygen 
in high metallicity regions (log([Nii]/l6583/Ha) > -1.26, or 
12 + log(0/H)> 8.2): 

, .^,TT^ ;;23 +54.2 -H59.45P -1-7.3 1P2 

12 + log(0/H)p = . (2) 

® 6.07H-6.71P-i-0.37p2 + o.243/?23 

and in low metallicity regions (log([Nii]/l6583/Ha') < -1.60, or 
12 + log(0/H)< 7.95): 

12 -I- log(0/H) = 6.35 -I- 1.45 log /?3 - 3.19 log P, (3) 

where R3 = /([Oiii]i/l4959,5007)//(H/3), and P = R3IR23. 
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3.4. N2 method 

Both /?23 and P metallicity are double valued. It is instructive 
to use one metallicity indicator to describe the whole metallicity 
with a single slope. The N2 = log[/([Nn]^6583)//(HQ')] index 
was found to fulfill this requirement by Denicolo et al. (2002). A 
least squares fit to the data simultaneously minimizing the errors 
in both axes gives 

12 + log(0/H) = 9.12 + 0.73 x N2. (4) 

The N2 indicator has advantages superior to the other metal- 
licity indicators. The N2 vs. metallicity relation is monotonic, 
and the N2 line ratio does not depend on reddening corrections 
or flux calibration. These advantages make N2 indicators able 
to break the degeneracy of the /?23-(0/H) (in Sect. 3.2) and the 
P-(0/H) (in Sect. 3.3) relation. 



by problems both with models of HII regions and calibra- 
tion. When comparing the numerical HII region models from 
Chariot & Longhetti (2001), which are at the base of the 
Bayesian abundances, with numerical models of other authors 
(Stasiriska & Leitherer (1996), McGaugh's model (McGaugh 
1991), CLOUDY (Ferland etal. 1998), or Kewley & Dopita 
(2002)), one will find that there is significant disagreement be- 
tween those models. It is because the stellar evolutionary syn- 
thesis code and photoionization code used in these models are 
in continuous progress and improved. It is also a result of using 
different atomic data or different assumptions in these models. 
The significant disagreement between those models prohibits the 
present-day models from providing uniform oxygen abundances. 

To help resolve this issue, one has to evaluate which is the 
most reliable oxygen abundance determination method to use 
when studying the metallicity of a galaxy. This will be the focus 
of the next section. 



3.5. 03N2 method 

03N2 = log {[/([Oiii]/15007)//(H/3)] / [/([Nn]/l6583)//(Ha)]) 
is another indicator that is monotonic. It was introduced by 
Afloin etal. (1979) and further studied by Pettini & Pagel 
(2004). Pettini & Pagel (2004) found that at C>3A^2 < 1.9, there 
appears to be a relatively tight, linear and steep relationship be- 
tween 03N2 and log (O/H). A least squares linear fit to the data 
in the range -1 < 03N2 < 1.9 yields the relation: 



12 + log (O/H) = 8.73 - 0.32 x 03A^2. 



(5) 



We use this expression to calculate the 03N2 oxygen abundance. 



3.6. Bayesian method 

It should be noted that the strong line methods, such as R^i, 
N2, P, 03N2, are empirical. The method based on physical 
model should be preferred rather than the empirical method. 
Besides the classic Tg method, the Bayesian method is a good 
method based on a physical model. This method was proposed 
by Tremonti et al. (2004). The Bayesian method is based on si- 
multaneous fits of all the most prominent emission lines ([On], 
Ha, [Oiii], Hj6, [Nn], [Sii]) with a model designed for the in- 
terpretation of integrated galaxy spectra (Chariot & Longhetti 
2001). We use Bayesian metallcities provided by the MPA/JHU 
groupQ 

There are systematic differences between the Bayesian 
method and method (Shi etal. 2006, 2007). The origin of 
the difference between the Bayesian metallicities and metal- 
licities have already been discussed by Yin et al. (2007). They 
found that for almost half of the sample galaxies (227 among 531 
galaxies with Tg measurements), Bayesian metallicities are over- 
estimated by a factor of about 0.34 dex on average. They pro- 
posed that the overestimates of Bayesian metallicities may be re- 
lated to the onset of secondary N enrichment in models. Another 
reason for the lower metallcities than Bayesian metallicities 
is that [O m]/i4363 emission line is biased by the very hot H ii re- 
gions in each galaxy. Thus, the global average temperature might 
be overestimated by 1000-3000K, which results in the system- 
atic underestimation of the oxygen abundance of 0.05-0.2 dex, 
as Nagao et al. (2006) proposed. 

Generally speaking, all the methods should result in the 
same value of abundance for a given galaxy. This is not the 
case in practice. It is evident that the discrepancy is caused 



4. Occam's razor meets oxygen abundance 
indicators 

4. 1 . Probability of a theory 

Let us consider a set of N methods {//, }, only one of which can be 
true, which we want to figure out. Then the probability of the k- 
th method, given the data D, is computed from Bayes' Theorem; 



P{Hk\D) 



P(Hk)A(D\Hk) 
ZiP(Hk)A{D\Hd 



The prior probabilities P(Hk) represent the investigator's de- 
gree of knowledge from previous measurements for the k-th 
method before seeing the data D. A{D\Hk) is a measure of how 
well Hit fits the data. Bayes' Theorem states that the posterior 
probability for a certain method to be true is proportional to its 
prior probability assigned before seeing the data D and the de- 
gree that Hk matches the data. 

Then our assignment translates into measuring the prior 
probabilities P{Hk) and A(D\Hit). Since prior probabilities are 
normalized over the parameter space. 



P(AuA2,-)dAidA 



2 — 



1, 



the influence of prior probabilities on the posterior probabil- 
ity diminishes. A(D\Hk, A\,A2, ...) contributes significantly to the 
posterior probability. In practice, the prior hypotheses is quite 
unimportant, especially for good data (such as SDSS data). This 
is because good data have a good chance of supporting the cor- 
rect method, even if the prior hypothesis is biased against an- 
other model. Only if the data are so bad or scant that they add 
little to our knowledge, will the posterior reflect the prior hy- 
potheses. 

For prior probabilities PiHk), we will assign equal prior 
probabilities to all methods, since we assume that we have no 
preference to any one of these methods in advance. As a result, 
to eliminate the influence of prior probabilities completely, we 
measure the mean likelihood of one method relative to another 
A,: 



(2;rcr2)2 



Z(D,) 



See |http://www.mpa-garching.mpg.de/SDSS/[ 



Here our data is a sample of Gaussian distributions with 
mean fi and variance cr^. Zi)(Di) is the standard metallcity for one 
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Table 1. The mean likelihood of one method relative to another. 
The likelihood are in unit of 10'*. 



1 500 [ 







Bayesian 


03A'2 




P 


N2 


A,' 




-10.24 


1.24 


-7.40 


0.34 


-2.33 




-2.14 




-2.41 


0.15 


-3.36 


-0.51 




0.47 


-13.03 




-9.48 


1.26 


-3.31 


A*^ 


-8.42 


0.92 


-9.43 




-10.10 


-0.95 



1: Ratio of the likelihood regarding abundance as observed abun- 
dance; 

2: Ratio of the likelihood regarding Bayesian abundance as observed 
abundance; 

3: Ratio of the likelihood regarding 03N2 abundance as observed 
abundance; 

4: Ratio of the likelihood regarding R23 abundance as observed abun- 
dance. 



1 000 



500 



10 



1 2 + log(0/H)^, 



galaxy which is a 'temporary' supposition. Z(Z),) is the metallic- 
ity derived by the ith-method. Table [T] gives the result of the 
mean likelihood of one method relative to another A^. 

The results clearly show that the likelihood of the Te, P and 
03N2 methods always have the same signs. This means that if 
any one method of the T^, P or 03N2 methods are (un)reliable, 
the other two methods will also be (un)reliable. It is a matter 
of course because the P and 03N2 methods are calibrated by 
Tf-method. The likelihood of the Bayesian and 7?23 methods 
always have same signs because both are calibrated by galaxy 
model. The likelihoods of the Bayesian method always have dif- 
ferent signs than the classic T,, method because there is a sys- 
tematic difference between the Bayesian method and method 
(see Sect l3.6l l. In either case, likelihood of the N2 method is al- 
ways negative using this dataset. This result may imply that the 
N2 method is an unreliable oxygen abundance determination 
method. An fundamental cause may lie in that the N2 method 
does not use oxygen emission line but only nitrogen emission 
line to calculate the oxygen abundance, which is obviously un- 
suitable. 



4.2. The Bayesian Information Criterion(SIC) 



Calculating probability of a theory in Sect. 14.11 does not give 
the reliability for a single method, so it is necessary to use an- 
other standard approach to test oxygen abundance determination 
methods, which is the Bayesian Information Criterion (SIC). 

Suppose we have two competing methods: f(Xi; 61 ... 
0,,,,), f{X2', (p\ ■ . ■ ipmi)- We have a random sample: X\,X2,- ■ ■ 
X„. The likelihood functions for the two competing methods 
are: L\{6\,92 ... d„\) and L2{4>\,(p2 ■■■ (pmi)- The Bayesian 
Information Criterion (SIC), which tests a hypothesis that one 
method fits the data better than the other, is defined as: 



BIC = 2lni 



On) 



) — (nil — fn2) X n. 



(6) 



L2( (pi,...,(p„) 

Then we shall calculate the MLE's (Maximum Likelihood 
Estimates) 6*,- and 0,- and then compute the estimated BIC: 



BIC = 2ln( 



Li( 0u.. 



) — (mi — m2) X n. 



(7) 



L2( <P\,---,<Pn) 

Using Eq|7] we can decide that the first method is superior 
to the second methods if it satisfies BIC > 10. According to hy- 



1 000 



800 



600 - 



400 



200 - 




12 + log(0/H)^p„ 

Fig. 1. Upper panel: The distribution for Te metallicity in our 
data sample. Lower panel: The distribution for Bayesian metal- 
licity in our data sample. 



pothesis testing in this situation, a value of 10 is strong evidence 
that the model 1 is preferred over model 2. 

Suppose our sample has a Gaussian distribution with mean 
fi and variance cr^. This is a good approximation to the real dis- 
tribution which is shown in Fig.[T] Note that the second term in 
Eq|7] always vanishes because the number of the parameters in 
our sample is 2 for all methods (yu and cr^). As a result, the degree 
of one method fitting the data for the k-th method 



{2n(T^)i 2cr^ ^ 



(8) 



represents the validity of one method. 

MLE for // is X. MLE for (p- is S^. Then Eq.[8]is changed to 



1 n-\ 
Tk - —expi — ). 



(27ro-2) 



(9) 



Table. |2]gives the result of the log likelihood for the validity 
of the Tg methods and Bayesian method. To show the behavior 
of the BIC for different types of galaxies, we calculate the like- 
lihood for each subsample in bin of concentration index (C) and 
absolute luminosity in R band (Mr). 

Under the hypothesis testing framework, the title of "favor- 
able method" is not normally bestowed unless BIC > 10. Table. 
12] shows the consistent result for each type of galaxy. The log 
likelihoods for the validity of the T^ methods are much higher 
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Table 2. The log likelihood for the validity of the methods 
and Bayesian method. 







Te 


Bayesian 


ln(L)' 


C < 2.3 


903.6 


166.5 


ln(L)' 


2.3 < C < 2.6 


1341.1 


333.8 


ln(L)' 


C > 2.6 


808.8 


212.2 


ln(L)2 


-26 <M,< -20.3 


121.0 


49.0 


ln(L)2 


-20.3 < Mr < -19.7 


151.6 


74.6 


ln(L)2 


-19.7 < M, < -14 


282.9 


92.3 



1: We use SIC method to find most reliable metallicity indicator in 
bin of C ; 

2: We use SIC method to find most reliable metallicity indicator in 
bin of M,.; 




Fig. 2. The log likelihood for the Te method and the Bayesian 
method in bins of different ratio of signal to noise. Star denotes 
the log likelihood of the subsample of S IN < 8; Cross denotes 
the log likelihood of 8 < S IN < 10; Diamond denotes the log 
likelihood of 10 < S IN < 13; Triangle denotes the log likelihood 
of S IN > 13; 



4.3. Three Information Scoring Functions: AIC, BIC and 
ICOMP 

While the previous section relies on Bayesian analysis, this sec- 
tion implements model selection based on information criteria. 
Traditional methods of model selection use hypothesis testing 
to draw some inference using two choices in a hypothesis test. 
The computed test statistic is compared with a threshold value 
to decide whether the researcher should accept or reject the hy- 
pothesis test. The preceding version of BIC actually performs a 
hypothesis test about which model is the correct one according 
to the computed likelihoods. The resulting value of BIC is then 
compared with a threshold value (we used 10) to see whether 
we should accept Model 1 or Model 2. We used this method in 
the previous section. By contrast, methods based on information 
theory do not rely on any hypothesis testing, but rather compute 
information scores to compare models. These methods try to bal- 
ance the log likelihood term (Lack of Fit) with a penalty term to 
control the complexity of the model. This philosophy is consis- 
tent with Occam's razor. The following are the modern versions 
of these information scoring functions. The most parsimonious 
balance between these competing terms will show the minimum 
information score computed by these functions. 

Information-based statistical analysis offers new ways to 
overcome problems associated with traditional methods. First 
proposed by Akaike (1973) and more fully developed by 
Bozdogan (1990, 2004), information scoring tries to find a bal- 
ance between the lack of fit in model's log likelihood and a 
penalty term that controls the complexity of the model. This 
methodology resolves ambiguities associated with hypothesis 
test-based methods because values associated with hypothesis 
tests can be near the threshold of acceptance or rejection. By 
contrast, under the information scoring methodology, the model 
that achieves the lowest information score is the best to describe 
the system under study. 

In our analysis, because the data all show univariate 
Gaussian distributions, we scored AIC, BIC and ICOMP under 
this assumption. The expression for the log likelihood is given 
by 



than the Bayesian method which makes BIC » 10. It is very 
strong evidence that Tg method is more reliable than Bayesian 
method. 

Though the decision is consistent, one should not trust the re- 
sult blindly. Considering carefully and deeply on this matter will 
puzzle us because the Bayesian method is based on a physical 
model (Chariot & Longhetti 2001) which takes into account pho- 
toionization calculations and the various stages of the evolution 
of stellar populations, but the Tg method is based on the much 
simpler model than Chariot & Longhetti (2001). Why is the 
credibility of Bayesian method not better than the Tg method? 
The result of the Bayesian method elaborated in Sect. l3.6l is not 
convincing enough to solve this problem. To resolve this prob- 
lem which cries for a solution, we plot the relationship between 
the log likelihood for the Tg method and the Bayesian method in 
Fig.|2]in bins of different ratios of signal to noise (S/N). 

Fig. |2] showes clearly that the credibility of the Bayesian 
method increases more quickly than Tg method with the increase 
of S/N. We can safely predict that the credibility of the Bayesian 
method will exceed the credibility of Tg method when the quality 
of the spectrum improves much better than SDSS. We anticipate 
that the next generation of telescopes can fulfill this mission. 



- llogLi®, Normal) = n \n{2n) + n \n{(r^) + n (10) 

where cr^ is the MLE of the variance and n is the number of data 
points in the sample. 

AIC is defined as minus 2 times the log likelihood plus 2 
times the number of free parameters: 

AICiNormal) = -2logL(&) + 2m 

= n ln(27r) -H « ln(o-^) + n + 2 X 2 ^^^"^ 

where logL{@) is the maximized log likelihood and m is the 
number of free parameters in the model. Likewise, BIC is given 
by: 

BIC(Normal) = -2logU®) + log(n)m 

, (12) 
= n ln(27r) + n ln(cr ) + n + 2log(n) 

ICOMP is defined as: 

ICOMP(Normal) = -2logL(@) + 2Ci (2f ) 

, (13) 
= «ln(27r) + «ln(cr2)-H«-H2Ci(ZF) 
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Table 3. AIC, BIC, ICOMP for the T, method and the Bayesian 
method. The UkeUhood are in units of 10"^. 







Bayesian 


AIC 
BIC 
ICOMP 


-8.511 
-8.499 
-8.512 


-4.171 
-4.159 
-4.173 



Instead of directly penalizing the number of free param- 
eters, ICOMP penaHzes the complexity of the Inverse Fisher 
Information Matrix. ICOMP is considered to be the most modern 
and consistent information scoring function (Bozdogan 2004). 
The Fisher Information Matrix for a univariate Gaussian distri- 
bution is given by: 



A 

tr- 

— 



(14) 



The complexity component of ICOMP is: 



Ci(2.') 



2 ^ 



q 



^log[det(2^i) 



(15) 



Here, q - rank(l,f). 

Using these expressions, we can compute the respective in- 
formation scores for the different oxygen abundances. 

In table [3] we give the result of these three scoring func- 
tions for the Tf method and the Bayesian method. It shows the 
consistent result with table |2] and table [T] that the method is 
more reliable than the Bayesian method using the current data 
sample. The three scoring functions (AIC, BIC, ICOMP) agree 
with each other and all confirm that, according to the sample of 
computed oxygen abundances, the method fits the data better 
than the Bayesian method. 

5. Conclusions 

We have presented a large sample of spectroscopic measure- 
ments of H II galaxies from SDSS DR4 covering a wide range of 
metallicites (7.5 < 12 + log(0/H) < 9.0). We have determined 
oxygen abundances for the sample, using different oxygen abun- 
dance indicators. We have studied the credibility of the different 
oxygen abundance indicators and obtained the following results. 

1. Tg- method is the more reliable oxygen abundance determi- 
nation methods than Bayesian metallcity under the existing 
telescope ability. We predict Bayesian metallcity will grad- 
ually become the most reliable oxygen abundance determi- 
nation method when the S/N of the spectrum is much higher 
than nowadays. 

2. Tc-, P-, (93A^2-method are consistent with each other 
because P-, (93A^2-method are calibrated by T^-method. 
Bayesian, /?23- method are consistent with each other be- 
cause both are calibrated by a galaxy model. 

3. In either case, the N2 method is an unreliable method. 
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